Model Selection

Mathematical Reasoning Optimization

# Mathematical Reasoning Optimization

Phi 4 Reasoning Plus GGUF

Phi-4-reasoning-plus is an open-source reasoning model developed by Microsoft Research, focusing on advanced reasoning capabilities in mathematics, science, and programming fields.

Large Language Model Supports Multiple Languages

Microsoft Phi 4 Reasoning GGUF

This is a quantized version of Microsoft's Phi-4-reasoning model, optimized using llama.cpp for inference tasks and supporting multiple quantization options.

Large Language Model

MiMo-7B-RL is a reinforcement learning model trained based on the MiMo-7B-SFT model, achieving performance comparable to OpenAI o1-mini in mathematical and code reasoning tasks.

Large Language Model

Tngtech.olmo 2 Instruct Math 32B GGUF

OLMo-2-Instruct-Math-32B is a large language model focused on mathematical tasks, released by tngtech.

Large Language Model

Openmath Nemotron 1.5B

OpenMath-Nemotron-1.5B is a mathematical reasoning model fine-tuned on the OpenMathReasoning dataset based on Qwen2.5-Math-1.5B, achieving state-of-the-art results on multiple mathematical benchmarks.

Large Language Model

Transformers English

Zero Mistral 24B

Zero-Mistral-24B is an improved text-only model based on Mistral-Small-3.1-24B-Instruct-2503, primarily adapted for Russian and English, with the original visual capabilities removed to focus on text generation tasks.

Large Language Model

Transformers Supports Multiple Languages

Openmath2 Llama3.1 8B

OpenMath2-Llama3.1-8B is a math-specialized model fine-tuned from the Llama3.1-8B-Base model using the OpenMathInstruct-2 dataset, demonstrating excellent performance across multiple mathematical benchmarks.

Large Language Model

Transformers English

Viper Coder V1.7 Vsm6

Viper-Coder-v1.7-Vsm6 is a large language model based on the Qwen2.5 14B modal architecture, focusing on improving coding efficiency and computational reasoning capabilities, optimizing memory usage, and reducing redundant text generation.

Large Language Model

Transformers Supports Multiple Languages

Phi 4 Reasoning Plus

Phi-4-reasoning-plus is an advanced open-weight reasoning model developed by Microsoft Research, optimized through supervised fine-tuning and reinforcement learning based on Phi-4, focusing on advanced reasoning capabilities in mathematics, science, and coding fields.

Large Language Model

Transformers Supports Multiple Languages

EZO2.5 Gemma 3 12b It Preview

A text generation model based on google/gemma-3-12b-it, enhanced with the EZO training method to improve Japanese performance

Large Language Model

Transformers Japanese

Notbad V1 1 Mistral 24b

A 24B-parameter large language model optimized for mathematical reasoning and Python programming training, based on the Mistral architecture

Large Language Model

Openrs3 GRPO Ja

OpenRS3-GRPO-ja is a fine-tuned version of the SakanaAI/TinySwallow-1.5B-Instruct model on a Japanese mathematical instruction dataset, trained using the GRPO method, focusing on mathematical reasoning tasks.

Large Language Model

Notbad V1 0 Mistral 24b

Notbad v1.0 Mistral 24B is a model focused on mathematical and Python programming reasoning, based on Mistral-Small-24B-Instruct-2501 and further trained with reinforcement learning.

Large Language Model

EXAONE Deep 7.8B GGUF

The EXAONE Deep series models excel in reasoning tasks such as mathematics and programming. The 7.8B version outperforms open-source models of similar scale and even surpasses certain proprietary models.

Large Language Model Supports Multiple Languages

Olmo 2 Instruct Math 32B

Based on the OLMo-2-0325-32B-Instruct model, fine-tuned using the Open R1 math dataset on AMD MI300X GPUs, focusing on enhancing mathematical reasoning capabilities

Large Language Model

Transformers English

Fastcurl 1.5B Preview

FastCuRL-1.5B-Preview is a slow-thinking reasoning model that employs curriculum-guided iterative extended reinforcement learning, excelling in mathematical reasoning tasks.

Large Language Model

Transformers English

Yixin Distill Qwen 72B 4.5bpw H6 Exl2

A high-performance mathematical reasoning and general knowledge processing model distilled from Qwen2.5-72B through reinforcement learning, excelling in mathematical reasoning and general knowledge tasks.

Large Language Model Supports Multiple Languages

Gemma 3 4b Reasoning

Gemma-3-4b Reasoning is a Transformer-based language model fine-tuned using the GRPO method, specializing in reasoning task optimization.

Large Language Model

Transformers English

Yixin Distill Qwen 72B

A high-performance distilled model optimized for mathematics and general reasoning, refined from Qwen2.5-72B through reinforcement learning

Large Language Model

Safetensors Supports Multiple Languages

Qwen 2.5 7B Reasoning

A fine-tuned version based on Qwen/Qwen2.5-7B-Instruct, specifically optimized for advanced reasoning tasks

Large Language Model

Transformers English

Sombrero Opus 14B Sm5

Designed based on Qwen 2.5 14B modal architecture, enhancing coding efficiency and computational reasoning capabilities

Large Language Model

Transformers Supports Multiple Languages

Tinyr1 32B Preview

Tiny-R1-32B-Preview is an inference model based on Deepseek-R1-Distill-Qwen-32B, focusing on mathematics, coding, and scientific fields, with performance close to the full version of the R1 model.

Large Language Model

Mistral Small 24B Instruct 2501 Reasoning

A mathematical reasoning model fine-tuned based on Mistral-Small-24B-Instruct-2501, optimized for mathematical reasoning capabilities

Large Language Model

Safetensors English

Sky T1 32B Flash

An optimized 32B inference model preference version based on Sky-T1-32B-Preview, significantly reducing generation length while maintaining accuracy.

Large Language Model

Transformers English

Internlm3 8b Instruct Gguf

The GGUF format version of the InternLM3-8B-Instruct model, suitable for the llama.cpp framework and supporting multiple quantization versions.

Large Language Model English

Tülu3 is a new generation of instruction-following model family developed by the Allen Institute for Artificial Intelligence, excelling in standard chat applications and complex problem-solving.

Large Language Model

Rho Math 1b V0.1

Rho-1 is a language model specialized in mathematics, pretrained with Selective Language Modeling (SLM) method, significantly improving accuracy in solving mathematical problems.

Large Language Model

Transformers English

UNA SimpleSmaug 34b V1beta

A supervised fine-tuned model based on Smaug-34B, focused on enhancing mathematical and reasoning capabilities, excelling among 34B-scale models.

Large Language Model

Westseverus 7B DPO V2

WestSeverus-7B-DPO-v2 is a WestLake family model trained based on WestSeverus-7B, with training on multiple DPO datasets, demonstrating excellent performance in basic mathematical problems.

Large Language Model

Transformers English

Neural Chat 7b V3 3

Neural-Chat-v3-3 is a 7-billion-parameter large language model developed by Intel based on the Mistral-7B architecture, focusing on mathematical reasoning and text generation tasks. The model is fine-tuned on the MetaMathQA dataset and aligned using Direct Performance Optimization (DPO) method.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase